Skip to content

Conversation

@mrDzurb
Copy link
Member

@mrDzurb mrDzurb commented Jun 29, 2025

Description

This PR consolidates the Group Multi-Model Deployment and Stacked Model Deployment features into the main branch.
The combined work enables deployment of multiple models (base + fine-tuned) on a single container instance, introduces validation and configuration updates, and includes comprehensive SDK, UI, and CLI enhancements to support both deployment and inferencing for stacked model groups.


Key Deliverables

1. Group Model Deployment

  • Added multi-model deployment configuration support in AQUA SDK, UI, and CLI.
  • Implemented logic to launch and manage multiple model instances in a single container via MD Groups.
  • Introduced backend validation for compatible shapes, GPU allocations, and container configurations.
  • Integrated multi-model evaluation and inferencing support.

2. Stacked Model Deployment

  • Introduced Stacked Model Deployment for base + fine-tuned LoRA weights.
  • Added dynamic merging of adapter weights in the VLLM SMC container.
  • Implemented logic for versioned fine-tuned models (FT-V1 → FT-V2) migration and compatibility.
  • Extended AQUA SDK/CLI to support stacked deployment lifecycle: create, edit, and infer.
  • Added validator to detect fine-tuned model versions and apply correct deployment path.
  • Implemented “Zero-Time Update” mechanism for fine-tuned weight management via CLI.

Testing

  • Unit and integration tests added
  • E2E tests covering both deployment and inferencing paths

@oracle-contributor-agreement oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Jun 29, 2025
@github-actions
Copy link

📌 Cov diff with main:

Coverage-0%

📌 Overall coverage:

Coverage-18.64%

@github-actions
Copy link

github-actions bot commented Jul 7, 2025

📌 Cov diff with main:

Coverage-0%

📌 Overall coverage:

Coverage-18.63%

@github-actions
Copy link

github-actions bot commented Jul 7, 2025

📌 Cov diff with main:

Coverage-3%

📌 Overall coverage:

Coverage-18.63%

@github-actions
Copy link

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-18.62%

@github-actions
Copy link

📌 Cov diff with main:

Coverage-1%

📌 Overall coverage:

Coverage-18.47%

@github-actions
Copy link

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

@github-actions
Copy link

📌 Cov diff with main:

Coverage-68%

📌 Overall coverage:

Coverage-58.39%

@github-actions
Copy link

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

lu-ohai
lu-ohai previously approved these changes Oct 30, 2025
@github-actions
Copy link

📌 Cov diff with main:

Coverage-69%

📌 Overall coverage:

Coverage-58.39%

@github-actions
Copy link

github-actions bot commented Nov 4, 2025

⚠️ This PR changed pyproject.toml file. ⚠️

  • PR Creator must update 📃 THIRD_PARTY_LICENSES.txt, if any 📚 library added/removed in pyproject.toml.
  • PR Approver must confirm 📃 THIRD_PARTY_LICENSES.txt updated, if any 📚 library added/removed in pyproject.toml.

@github-actions
Copy link

github-actions bot commented Nov 4, 2025

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

@github-actions
Copy link

github-actions bot commented Nov 4, 2025

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

@mrDzurb mrDzurb changed the title [INT TEST][AQUA] Group Model Deployment [Ready To Merge][AQUA] Group Model Deployment Nov 6, 2025
lu-ohai
lu-ohai previously approved these changes Nov 6, 2025
@github-actions
Copy link

github-actions bot commented Nov 6, 2025

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

@mrDzurb mrDzurb requested a review from lu-ohai November 6, 2025 21:47
@mrDzurb mrDzurb changed the title [Ready To Merge][AQUA] Group Model Deployment [Ready To Merge][AQUA] Group Model Deployment & Stacked Model Deployment Nov 6, 2025
@github-actions
Copy link

github-actions bot commented Nov 6, 2025

📌 Cov diff with main:

Coverage-2%

📌 Overall coverage:

Coverage-17.91%

@mrDzurb mrDzurb merged commit 26cda04 into main Nov 6, 2025
22 of 24 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

OCA Verified All contributors have signed the Oracle Contributor Agreement.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants